Extracting Web User Profiles Using Relational Competitive Fuzzy Clustering
نویسندگان
چکیده
The proliferation of information on the World Wide Web has made the personalization of this information space a necessity. An important component of Web personalization is to mine typical user pro les from the vast amount of historical data stored in access logs. In the absence of any a priori knowledge, unsupervised classi cation or clustering methods seem to be ideally suited to analyze the semi-structured log data of user accesses. In this paper, we de ne the notion of a \user session" as being a temporally compact sequence of Web accesses by a user. We also de ne a new distance measure between two Web sessions that captures the organization of a Web site. The Competitive Agglomeration clustering algorithm which can automatically cluster data into the optimal number of components is extended so that it can work on relational data. The resulting Competitive Agglomeration for Relational Data (CARD) algorithm can deal with complex, nonEuclidean, distance/similarity measures. This algorithm was used to analyze Web server access logs successfully and obtain typical session pro les of users.
منابع مشابه
Mining Web Access Logs Using Relational Competitive Fuzzy Clustering
The proliferation of information on the World-Wide Web has made the personalization of this information space a necessity. An important part of Web personalization is to mine typical user profiles from the vast amount of historical data stored in access logs. In this paper, we define the notion of a “user session” and a new distance measure between two web sessions that captures the organizatio...
متن کاملWeb User Profiling Using Relational Fuzzy Clustering
User profiling is a fundamental task in Web personalization. In this paper, we use a relational fuzzy clustering to discover user profiles from Web log data. Precisely, a modified version of the CARD algorithm, called CARD+, is proposed to discover clusters embedded in the Web usage data and derive profiles modeling the real user preferences. Experimental results on log data extracted from log ...
متن کاملRelational fuzzy approach for mining user profiles
Capturing the characteristics and preferences of Web users into user profiles is a fundamental task to perform in order to implement forms of personalization on a Web site. In this paper, we present a relational fuzzy clustering approach to extract significant user profiles from session data derived from log files. In particular, a modified version of the CARD clustering algorithm is proposed i...
متن کاملLearning Web Users Profiles With Relational Clustering Algorithms
In the context of web personalization and dynamic content recommendation, it is crucial to learn typical user profiles. Although there exists several approaches to mine user profiles (such as association rules or sequential patterns extraction), this paper focuses on the application of relational clustering algorithms on web usage data to characterize user access profiles. These methods rely on...
متن کاملRough set based User profiling for Web Personalization
Web usage mining has recently emerged as a basis for extracting useful user access pattern information, such as user profiles, from enormous amounts of Web log data for web site personalization. A profile can consist of a set of URLs that are relevant to the sessions assigned to a given cluster. Once these profiles are discovered, they can be exploited as part of an automated personalization on...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- International Journal on Artificial Intelligence Tools
دوره 9 شماره
صفحات -
تاریخ انتشار 2000